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SPECIFICATION 

TITLE OF THE INVENTION 

5 INFORMATION PROCESSING METHOD AND APPARATUS 

FIELD OF THE INVENTION 
The present invention relates to management of 
metadata to be attached to binary data, and more 
10 particularly, to information processing method and 

apparatus which enable high-speed access to metadata. 

BACKGROUND OF THE INVENTION 
Metadata is "data on data" used as explanation of 
15 binary data such as image data and sound data. A search 
for image data, sound data and the like can be made by 
metadata set for respective binary data. Now the 
usefulness of the metadata is widely known. Various 
forms of metadata are prepared for binary data, and 
20 attempts to utilize them in searching by using data base 
are made. 

As binary data search methods, a method of 
generating data base related to binary data as search 
subjects and a method of performing a search by reading 
25 metadata at whatever time are known. In the former case, 
a search can be made at a high speed when an enormous 
number of search subject data are prepared. However, the 
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method cannot easily handle changes in situation due to 
addition and/or deletion of binary data. Accordingly, 
further proposed is a method having flexibly to changes 
in situation due to addition and/or deletion of binary 

5 data by describing binary data and metadata in the same 
file and using the latter search method. 

However, in the method of describing binary data 
and metadata in the same file, when an enormous amount 
of search subject data are provided, the speed of search 

10 processing is extremely lowered since a search must be 
made by reading files each including binary data and 
metadata and extracting the metadata. Especially, when a 
search is made for binary data stored in a storage 
medium with a low access speed such as a magneto-optic 

15 disk (MO) , the speed of search processing is seriously 
reduced. 

SUMMARY OF THE INVENTION 

20 The present invention has been made in 

consideration of the above problems, and has its object 
to enable high-speed access to metadata of binary data 
as a search subject. 

Further, in case of metadata of independent-format 
25 binary data, significant data cannot be extracted unless 
the data is read in accordance with the format. 
Accordingly, another object of the present invention is 
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to solve the problem and enable flexible access to 
internal data by using metadata described in highly 
versatile data description language. 

According to the present invention, the foregoing 
5 object is attained by providing an information 

processing method for storing binary data and metadata 
related to binary data into a storage medium, comprising 
an allocation step of allocating a first storage area 
for metadata in advance on the storage medium, a first 
10 storage step of allocating a metadata storage area for 
storing metadata from the first storage area allocated 
at the allocation step, and storing metadata into the 
metadata storage area, a second storage step of storing 
binary data related to metadata into a second storage 
15 area other than the first storage area on the storage 
medium, and a third storage step of storing link 
information that links metadata stored in the first 
storage area with binary data stored in the second 
storage area, in correspondence with metadata, into the 
20 first storage area, wherein at third storage step, the 
link information is stored into an area adjacent to an 
area where metadata is stored. 

Further, the foregoing object is attained by 
providing an information processing apparatus for 
25 storing binary data and metadata related to the binary 
data into a storage medium, comprising allocation means 
for allocating a first storage area for metadata in 



advance on the storage medium, first storage means for 
allocating a metadata storage area for storing metadata 
from the first storage area allocated by the allocation 
means, and storing metadata into the metadata storage 
5 area, second storage means storing binary data related 
to metadata into a second storage area other than the 
first storage area on the storage medium, and third 
storage means for storing link information that links 
metadata stored in the first storage area with binary 

10 data stored in the second storage area, in 

correspondence with metadata, into the first storage 
area, wherein the third storage means stores the link 
information into an area adjacent to an area where 
metadata is stored. 

15 Other features and advantages of the present 

invention will be apparent from the following 
description taken in conjunction with the accompanying 
drawings, in which like reference characters designate 
the same name or similar parts throughout the figures 

20 thereof. 



BRIEF DESCRIPTION OF THE DRAWINGS 



The accompanying drawings, which are incorporated 
25 in and constitute a part of the specification, 

illustrate embodiments of the invention and, together 
with the description, serve to explain the principles of 
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the invention. 

Fig. 1 is a block diagram showing an example of 
system configuration according to a first embodiment of 
the present invention ; 
5 Fig. 2 is a schematic diagram showing the 

structure of binary data with metadata and form of data 
storage into a storage medium; 

Fig. 3 is a table showing an example of metadata 
managed in accordance with the first embodiment; 
10 Fig. 4 is an example of XML description of the 

metadata in Fig. 3; 

Fig. 5 is a perspective view explaining a metadata 
storage area according to the first embodiment; 

Fig. 6 is a schematic diagram showing the storage 
15 area of a disk when a metadata storage area is allocated 
by generating an area file; 

Fig. 7 is a flowchart showing file storage 
processing according to the first embodiment; 

Fig. 8 is a flowchart showing processing to 
20 allocate an area for storing metadata from the metadata 
storage area and store the metadata into the area; 

Fig. 9 is a flowchart showing processing to 
establish linkage between binary data and metadata; 

Fig. 10 is a schematic view explaining the status 
25 of file stored in the storage medium according to the 
first embodiment; and 

Fig. 11 is an example of metadata description 



according to a third embodiment of the present invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

5 Preferred embodiments of the present invention 

will now be described in detail in accordance with the 
accompanying drawings . 
<First Embodiment> 

As a first embodiment, information processing 
10 apparatus and method for high-speed access to metadata 
in a case where binary data and metadata are stored in 
the same file will be described. 

Fig. 1 is a block diagram showing an example of 
system configuration according to the first embodiment. 
15 In Fig. 1, a data reading unit 101 which reads data 

includes devices such as a scanner. A data input unit 
102 which inputs an instruction from a user or data 
includes pointing devices such as a keyboard and a mouse. 
A storage unit 103, which is a device for storing a 
20 control program and the like, is generally a hard disk 
or the like. A display unit 104, which displays a GUI 
image and the like, is generally a CRT, or a liquid 
crystal display. 

A CPU 105 relates to all the processings in the 
25 above elements. A ROM 106 and a RAM 107 provide a 

program, data, a work area and the like necessary for 
processing to the CPU 105. Further, a control programs 
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necessary for all the processings is stored in the 
storage unit 103 or the ROM 106. In a case where the 
control program to be executed is stored in the storage 
unit 103, the program is temporarily read (loaded) onto 

5 the RAM 107 and then executed by the CPU 105. A 

recording unit 108, which is a device for storing binary 
data and metadata, is an MO, DVD-RAM or the like. 

Regarding the system configuration, various 
constituent elements other than the above elements may 

10 be provided and various modifications may be made to the 
system, however, such matter is not the principal object 
of the present invention, therefore the explanation 
thereof will be omitted. 

Hereinbelow, first, the structure of file in which 

15 binary data and metadata are stored will be described, 
then a metadata storage area in which the metadata is 
stored will be described, then processing to acquire the 
metadata storage area will be described, and finally, 
processing to save the file where the binary data and 

20 the metadata are stored will be described. 

(File Structure) 

Fig. 2 is a schematic diagram showing the 
structure of binary data with metadata and form of data 
25 storage into a storage medium. In the present embodiment, 
binary data is a still image data in DCF (Design Rule 
for Camera File System) (a unified digital camera general 



recording format) basic file format. In Fig. 2, the 
binary data and the metadata construct one file, however, 
they are stored in different areas on a storage medium 
corresponding to the recording unit 108 in Fig. 1. That 

5 is, the metadata is stored into a storage area for 

metadata {hereinbelow, metadata storage area) , while the 
binary data is stored into an area other than the 
metadata storage area (hereinbelow, general area) . 
Further, as a file structure, metadata is attached to 

10 the end of binary data. In this manner, as a binary file 
and metadata are stored as the same file, linkage can be 
easily made between the binary file and the metadata 
upon file movement or deletion. 

Fig. 3 is a table showing an example of metadata 

15 managed in accordance with the first embodiment. The 

table shows metadata attached to still image binary data. 
As a format of metadata attached to a still image, 
expression of pair of data attribute and data value as 
shown in Fig. 3 can be given. 

20 In the example of Fig. 3, five attributes of 

metadata, "PhotoGrapher" , "Date", "Location", "Event", 
"Keyword" are shown, and as respective data values, the 
name of photographer is described for the attribute 
"PhotoGrapher"; the date of photo shooting, for the 

25 attribute "Date"; the location of photo shooting, for 
the attribute "Location"; the name of event, for the 
attribute "Event"; and the name of subject, for the 



attribute "Keyword" . 

Metadata may be stored as text data, however, in 
the present embodiment, the stored metadata is described 
in data description language XML. Fig. 4 is an example 
5 of XML description of metadata in Fig. 3. First, a tag 
representing the start of XML is described, then a tag 
representing the metadata is described. For example, in 
the present embodiment, data described by enclosing the 
metadata in Fig. 3 with a start tag <PHOTO> and an end 

10 tag </PHOTO> is metadata. Further, in the present 

embodiment, in each metadata, attribute strl and data 
str2 are described as 

<ITEM ATTR = "strl"> str2 </ITEM>. 
By this metadata description in XML, highly flexible 

15 data description can be made. 

(Metadata Storage Area) 

Fig. 5 is a perspective view explaining a metadata 
storage area according to the first embodiment. In the 

20 present embodiment, a storage medium corresponding to 

the recording unit 108 is an MO, and a logical format of 
MO disk is UDF (Universal Disk Format) . In use of UDF 
file management system, file divisional storage is 
possible. For example, in the present embodiment, file 

25 divisional storage as shown in Fig. 2 is realized by UDF. 
As shown in Fig. 5, the MO disk has a case 501 
accommodating a disk 502. On the disk 502, a metadata 



storage area 503 is allocated for storing metadata by 
metadata area allocation processing to be described 
below, and the area is discriminated from a general area 
504. Preferably the metadata storage area 503 is a 

5 continuous area. In a case where the storage area 503 is 
a continuous area, when metadata is referred to in 
search processing or the like, the processing can be 
made at a high speed. Further, as shown in Fig. 5, it is 
preferable that the metadata storage area 503 is 

10 allocated in the central portion of the disk 502 which 

can be accessed at a high speed. In this case, the speed 
of access to metadata is further improved. 

(Metadata Allocation Processing) 

15 Next, the metadata area allocation processing will 

be described. In UDF, a disk area of particular 
directory can be allocated in advance only for use of 
file under the directory. However, in the present 
embodiment, the metadata storage area is allocated by 

20 generating an "area file" having a file size of metadata 
storage area. The name of directory for which the 
metadata area is allocated is designated when the area 
is allocated by using the function of UDF. Accordingly, 
the metadata area is assigned to the directory at the 

25 same time when it is allocated. The allocation of 
metadata storage area is performed by the user's 
instruction after disk initialization. There is no 



problem if the area allocation is automatically 
performed after the initialization by using a 
specialized driver or the like. In this manner, the 
allocation of metadata storage area by generating an 

5 area file is effective in a file system which lacks 

means for allocating a disk area in advance. Note that 
in this case, the allocation of metadata storage area 
may be made within a directory for storing binary data, 
or in other directories than the above directory. 

10 Fig. 6 is a schematic diagram showing the storage 

area of a disk when a metadata storage area is allocated 
by generating an area file. As shown in Fig. 6, the 
metadata area 503 is allocated by generating an area 
file having the size of metadata storage area on the 

15 disk. When metadata is stored by file storage processing 
to be described later, the size of the area file is 
reduced in correspondence with the amount of the 
metadata. That is, the sum of the size of stored 
metadata and the size of the area file is always equal 

20 to the size of metadata storage area 503 (See Fig. 10) . 

Note that it is desirable that the area file is 
prevented from being erroneously deleted by setting a 
file attribute to invisible or setting write protection. 



25 {Processing to Save File Including Binary Data and 
Metadata) 

Next, a procedure for saving a file, having the 
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structure as shown in Fig. 2, on a storage medium where 
the metadata storage area is allocated as above will be 
described. Fig. 7 is a flowchart showing file storage 
processing according to the first embodiment. 
5 First, at step S600, it is determined whether or 

not a save subject file is a file including metadata and 
binary data. In the present embodiment, it is determined 
whether metadata is included in a file by extracting the 
last 8 bytes of the file and examining whether or not 
10 the 8 bytes correspond with "</PH0T0>". If the save 
subject file does not include metadata, the process 
proceeds to step S605, at which the file is saved in a 
general area, and the process ends. On the other hand, 
if the save subject file includes metadata and binary 
15 data, the process proceeds to step S601. At step S601, a 
part enclosed with tags "<PH0T0>" and "< /PHOTO "is 
separated as metadata, thereby the binary data is 
extracted from the save subject file and written into 
the general area 504. 
20 Next, at step S602, an area having a size 

necessary for storing the metadata (the above separated 
metadata) of the save subject file is allocated from the 
metadata storage area. In the present embodiment, a 
storage area necessary for each metadata is sequentially 
25 allocated from the head of the metadata storage area. At 
this time, the size of the area file is reduced in 
correspondence with a size used for storage of metadata. 
12 



Then, at step S603, the metadata of the save subject 
file is written into the area allocated at step S602. 
Note that the details of the metadata storage processing 
at steps S602 and S603 will be described later with 
5 reference to the flowchart of Fig. 8. At step S604, a 
pointer to refer to the binary data from the metadata is 
set, and the process ends. Note that the processing at 
step S604 will be described later with reference to the 
flowchart of Fig. 9. 
10 Fig. 8 is a flowchart showing processing to 

allocate an area for storing metadata from the metadata 
storage area and store the metadata in the area. 

First, at step S701, a storage start position L start 
and a storage end position L en d of area file on the 
15 storage medium are obtained. In the present embodiment, 
the positions L sta rt and L end are represented by sector 
number. Next, to use a part of the area file (a part 
from the header) for storing the metadata, the area file 
is deleted at step S702. Then at step S703, the metadata 
20 is stored from the storage start position L star t ° n the 
storage medium, and the process proceeds to step S7 04. 
At step S704, a next sector number to the storage end 
position is obtained as L' start . At step S705, an area 
file is newly generated with the position L ! start as the 
25 storage start position and the storage end position L end , 
and the process ends . 

By the above processing, the metadata is stored 
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from the head of the area (corresponding to the metadata 
storage area 502 in the initial state) allocated by the 
area file, and the remaining area is newly allocated by 
the area file. 

5 Next, processing to establish linkage between the 

binary data stored in the general area 504 and the 
metadata (step S604) will be described. Fig. 9 is a 
flowchart showing the processing to establish linkage 
between binary data and metadata. Note that both data 
10 can be linked with each other by storing information 

specifying the binary data in the metadata, however, in 
the present embodiment, information to link the metadata 
with the binary data (link information) is stored in 1- 
sector area (fixed length area) following the metadata 
15 within the metadata storage area. In the present 

embodiment, the link information is a pointer which 
represents binary data to be linked by using a path and 
a file name. Note that the link information is not 
limited to that in the present embodiment but may be a 
20 head number of sector holding binary data to be linked. 

In a case where the metadata includes description 
of link information specifying its related binary data, 
such processing is unnecessary. However, if the link 
information is stored independently of the metadata as 
25 described above, linkage can be made in use of metadata 
which lacks description of link information, and the 
flexibility of the system can be improved. 

14 



At steps S801 and S802, as in the case of 
processing at steps S701 and S702, the storage start 
position L start and the storage end position L end of area 
file are obtained and then the area file is deleted. 
5 Then at step S803, a pointer, i.e., a path and a file 
name of related binary data are stored in a sector 
designated by the position L start . Then at step S804, the 
sector next to the position L start is set as L' sta rtf and 
at step S805, as in the case of step S705, an area file 
10 is generated by the positions L' start and L end , and the 
process ends. 

In this manner, as link information to refer to 
binary data from metadata is added to the metadata, it 
is possible to read only the metadata storage area to 
15 perform a search and extract necessary binary data. 

By the above-described processing, when one file 
including binary data and metadata is stored into a 
storage medium, the binary data can be stored into the 
general area 504 and the metadata, into the metadata 
20 storage area 503, respectively. 

Fig. 10 is a schematic view explaining the status 
of file stored in the storage medium according to the 
first embodiment. Fig. 10 shows stored two files (file 1 
and file 2) having binary data and metadata. As shown in 
25 Fig. 10, binary data 1001 of the file 1 and binary data 
1002 of the file 2 are stored in the general area. 
Metadata 1003 of the file 1, a pointer (link 



information) 1004 from the metadata to the binary data 
in the file 1, metadata 1005 of the file 2, a pointer 
(link information) 1006 from the metadata to the binary- 
data in the file 2, are stored in the metadata storage 

5 area. The remaining area is held as an area file 1007. 

As described above, as metadata is stored in a 
continuous area on a recording medium, only the metadata 
can be read at a high speed. Further, as a pointer {link 
information) to binary data related to the metadata is 

10 stored with the metadata, even in a case where the 

metadata lacks description of link information (file 
name or the like) , access to necessary binary data can 
be made . 

Further, in an ordinary relational database, it is 
15 necessary to store a path of search subject file and its 
metadata into the database. In this case, upon movement 
and/or deletion of file, the content of the data base 
must be updated. On the other hand, according to the 
present embodiment, as metadata and binary data are 
20 stored in one file such that a search for the metadata 
attached to the binary data is made, the above-described 
processing upon file movement and/or deletion can be 
omitted. 

Further, in the above embodiment, the link 
25 information is stored into the metadata storage area, 
however, it may be arranged such that all the link 
information are registered in a data base. 
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<Second Embodiments 

In the first embodiment, the method for high-speed 
access to metadata stored with binary data in one file 
5 has been described- As a second embodiment, a case where 
mutually related binary data and metadata are stored in 
different files will be described. Note that the system 
to realize data management described in the second 
embodiment has the same configuration as that of the 
10 first embodiment. 

In some file systems, one file cannot be 
divisionally stored. In such case, when binary data and 
its metadata are stored as different files, to access 
the metadata at a high speed, the metadata file is 
15 written into the metadata storage area and the binary 

data file is written into the general area, in a similar 
method to that of the first embodiment. 

Note that when the binary data and the metadata 
are stored in different files, the metadata file 
20 includes a pointer to the binary data to be referred to. 
Accordingly, it is not necessary to store a pointer to 
binary data in the metadata storage area as in the case 
of the first embodiment. 

As described above, even in a case where binary 
25 data and metadata are stored in different files in a 

file system where one file cannot be divisionally stored, 
as only the metadata is written into a pre-provided 



continuous area, the metadata can be accessed at a high 
speed. 

<Third Embodiment> 
5 In the above respective embodiments, XML is used 

as metadata description format. As a metadata standard 
using XML, DIG35 is known. It is possible to apply the 
DIG35 standard to metadata description in the 
constructions described in the first and second 
10 embodiments . 

The DIG35 standard is used for standardization of 
item and description method of still image metadata, and 
is characterized in that XML is used for describing 
metadata. Fig. 11 is an example of the content of 
15 metadata in Fig. 3 described in conformance with the 

DIG35 standard. First, a tag representing the start of 
metadata based on the DIG35 standard is described. For 
example, in the present embodiment, as shown in Fig. 11, 
described data enclosed with a start tag <METADATA> and 
20 an end tag </METADATA> can be determined as metadata. In 
this manner, as metadata is described in a predetermined 
structure using XML, environment-independent and highly- 
flexible data description can be made. 

The construction to save a binary file including 
25 metadata described based on the above DIG35 standard is 
as described in the first embodiment, therefore the 
explanation of the construction will be omitted, and a 
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part different from the first embodiment will be 
described. 

First, in a case where metadata in conformity with 
the DIG35 standard is used, the tags <METADATA> and 
5 </METADATA> are used in place of the tags <PHOTO> and 

</PHOTO> in the file storage processing in Fig. 7. That 
is, when it is determined at step S600 whether or not 
the save subject file includes metadata and binary data, 
the last 11 bytes of the file are extracted and it is 
10 examined whether or not the 11 bytes correspond with 
"</METADATA>" . If the save subject file includes 
metadata and binary data, the process proceeds to step 
S601, at which metadata enclosed with the tags 
<METADATA> and </METADATA> is separated from the save 
15 subject file, thereby the binary data is extracted and 
written into the general area 504. 

In the constructions described in the first and 
second embodiments, metadata in conformance with the 
DIG35 standard can be handled by the above change. 
20 As described above, according to the above 

respective embodiments, metadata is stored in a pre- 
allocated specialized area, thereby the metadata can be 
accessed at a high speed. Further, as link information 
to binary data is included on the metadata side, binary 
25 data related to the metadata can be easily extracted. 

Note that in the above respective embodiments, 
binary data is still image data, however, the binary 
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data is not limited to still image data. For example, as 
binary data, video data, sound data, music data and the 
like may be handled. 

Further, in the above first embodiment, XML is 
5 used for description of metadata, however, other data 
description languages such as HTML (Hypertext Markup 
Language) and SGML (Standard Generalized Markup 
Language) may be used, and further, TIFF tags may be 
used. 

10 Further, in the above respective embodiments, a 

magneto-optic disk is used as the storage medium 
corresponding of the recording unit 108, however, the 
storage medium is not limited to the magneto-optic disk. 
For example, a floppy disk, a memory card, a hard disk 

15 and the like may be used as the storage medium of the 
recording unit 108. 

Further, in the above first embodiment, one file 
where binary data and metadata are described is saved, 
and in the second embodiment, different files where 

20 binary data and metadata are respectively described are 
saved, however, it may be arranged such that one of 
processings described in the first and second 
embodiments is selected in accordance with type of save 
subject file. For example, as described in the first 

25 embodiment, it can be determined whether or not binary 
data and metadata are described in one file by 
extracting the last 8 bytes of file and examining 



whether or not the 8 bytes correspond with "</PHOTO>". 
If it is determined that binary data and metadata are 
described in one file, the processing in the first 
embodiment is performed, otherwise, the processing in 
5 the second embodiment is performed. In this case, 

various file formats can be handled in a flexible manner. 

The present invention can be applied to a system 
constituted by a plurality of devices (e.g., a host 
computer, an interface, a reader and a printer) or to an 
10 apparatus comprising a single device (e.g., a copy 
machine or a facsimile apparatus) . 

Further, the object of the present invention can 
be also achieved by providing a storage medium (or 
recording medium) storing software program code for 
15 performing the aforesaid processes to a system or an 
apparatus, reading the program code with a computer 
(e.g., CPU, MPU) of the system or apparatus from the 
storage medium, then executing the program. 

In this case, the program code read from the 
20 storage medium realizes the functions according to the 
embodiments, and the storage medium storing the program 
code constitutes the invention. 

Further, the storage medium, such as a floppy disk, 
a hard disk, an optical disk, a magneto-optical disk, a 
25 CD-ROM, a CD-R, a DVD, a magnetic tape, a non-volatile 
type memory card, and ROM can be used for providing the 
program code. 
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Furthermore, besides aforesaid functions according 
to the above embodiments are realized by executing the 
program code which is read by a computer, the present 
invention includes a case where an OS (operating system) 
5 or the like working on the computer performs a part or 
entire processes in accordance with designations of the 
program code and realizes functions according to the 
above embodiments. 

Furthermore, the present invention also includes a 
10 case where, after the program code read from the storage 
medium is written in a function expansion card which is 
inserted into the computer or in a memory provided in a 
function expansion unit which is connected to the 
computer, CPU or the like contained in the function 
15 expansion card or unit performs a part or entire process 
in accordance with designations of the program code and 
realizes functions of the above embodiments. 

As described above, according to the present 
invention, metadata of binary data as a search subject 
20 can be accessed at a high speed. 

As many apparently widely different embodiments of 
the present invention can be made without departing from 
the spirit and scope thereof, it is to be understood 
that the invention is not limited to the specific 
25 embodiments thereof except as defined in the appended 
claims . 
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